智能论文笔记

Machine Learning Approaches to Predict Breast Cancer: Bangladesh Perspective

Taminul Islam , Arindom Kundu , Nazmul Islam Khan , Choyon Chandra Bonik , Flora Akter , Md Jihadul Islam

分类：机器学习

2022-06-30

如今，乳腺癌已成为近年来最突出的死亡原因之一。在所有恶性肿瘤中，这是全球妇女最常见和主要的死亡原因。手动诊断这种疾病需要大量的时间和专业知识。乳腺癌的检测是耗时的，并且可以通过开发基于机器的乳腺癌预测来减少疾病的传播。在机器学习中，系统可以从先前的实例中学习，并使用各种统计，概率和优化方法从嘈杂或复杂的数据集中找到难以检测的模式。这项工作比较了几种机器学习算法的分类准确性，精度，灵敏度和新近收集的数据集的特异性。在这种工作决策树，随机森林，逻辑回归，天真的贝叶斯和XGBoost中，已经实施了这五种机器学习方法，以在我们的数据集中获得最佳性能。这项研究的重点是找到最佳的算法，该算法可以预测乳腺癌，以最高的准确性。这项工作在效率和有效性方面评估了每种算法数据分类的质量。并与该领域的其他已发表工作相比。实施模型后，本研究达到了最佳模型准确性，在随机森林和XGBoost上达到94％。

translated by 谷歌翻译

Detecting Change Intervals with Isolation Distributional Kernel

Yang Cao , Ye Zhu , Kai Ming Ting , Flora D. Salim , Hong Xian Li , Gang Li

分类：机器学习

2022-12-30

Detecting abrupt changes in data distribution is one of the most significant tasks in streaming data analysis. Although many unsupervised Change-Point Detection (CPD) methods have been proposed recently to identify those changes, they still suffer from missing subtle changes, poor scalability, or/and sensitive to noise points. To meet these challenges, we are the first to generalise the CPD problem as a special case of the Change-Interval Detection (CID) problem. Then we propose a CID method, named iCID, based on a recent Isolation Distributional Kernel (IDK). iCID identifies the change interval if there is a high dissimilarity score between two non-homogeneous temporal adjacent intervals. The data-dependent property and finite feature map of IDK enabled iCID to efficiently identify various types of change points in data streams with the tolerance of noise points. Moreover, the proposed online and offline versions of iCID have the ability to optimise key parameter settings. The effectiveness and efficiency of iCID have been systematically verified on both synthetic and real-world datasets.

translated by 谷歌翻译

Automated Level Crossing System: A Computer Vision Based Approach with Raspberry Pi Microcontroller

Rafid Umayer Murshed , Sandip Kollol Dhruba , Md. Tawheedul Islam Bhuian , Mst. Rumi Akter

分类：计算机视觉

2022-12-08

In a rapidly flourishing country like Bangladesh, accidents in unmanned level crossings are increasing daily. This study presents a deep learning-based approach for automating level crossing junctions, ensuring maximum safety. Here, we develop a fully automated technique using computer vision on a microcontroller that will reduce and eliminate level-crossing deaths and accidents. A Raspberry Pi microcontroller detects impending trains using computer vision on live video, and the intersection is closed until the incoming train passes unimpeded. Live video activity recognition and object detection algorithms scan the junction 24/7. Self-regulating microcontrollers control the entire process. When persistent unauthorized activity is identified, authorities, such as police and fire brigade, are notified via automated messages and notifications. The microcontroller evaluates live rail-track data, and arrival and departure times to anticipate ETAs, train position, velocity, and track problems to avoid head-on collisions. This proposed scheme reduces level crossing accidents and fatalities at a lower cost than current market solutions. Index Terms: Deep Learning, Microcontroller, Object Detection, Railway Crossing, Raspberry Pi

translated by 谷歌翻译

CrossPyramid: Neural Ordinary Differential Equations Architecture for Partially-observed Time-series

Futoon M. Abushaqra , Hao Xue , Yongli Ren , Flora D. Salim

分类：机器学习

2022-12-07

Ordinary Differential Equations (ODE)-based models have become popular foundation models to solve many time-series problems. Combining neural ODEs with traditional RNN models has provided the best representation for irregular time series. However, ODE-based models require the trajectory of hidden states to be defined based on the initial observed value or the last available observation. This fact raises questions about how long the generated hidden state is sufficient and whether it is effective when long sequences are used instead of the typically used shorter sequences. In this article, we introduce CrossPyramid, a novel ODE-based model that aims to enhance the generalizability of sequences representation. CrossPyramid does not rely only on the hidden state from the last observed value; it also considers ODE latent representations learned from other samples. The main idea of our proposed model is to define the hidden state for the unobserved values based on the non-linear correlation between samples. Accordingly, CrossPyramid is built with three distinctive parts: (1) ODE Auto-Encoder to learn the best data representation. (2) Pyramidal attention method to categorize the learned representations (hidden state) based on the relationship characteristics between samples. (3) Cross-level ODE-RNN to integrate the previously learned information and provide the final latent state for each sample. Through extensive experiments on partially-observed synthetic and real-world datasets, we show that the proposed architecture can effectively model the long gaps in intermittent series and outperforms state-of-the-art approaches. The results show an average improvement of 10\% on univariate and multivariate datasets for both forecasting and classification tasks.

translated by 谷歌翻译

Integrated Convolutional and Recurrent Neural Networks for Health Risk Prediction using Patient Journey Data with Many Missing Values

Yuxi Liu , Shaowen Qin , Antonio Jimeno Yepes , Wei Shao , Zhenhao Zhang , Flora D. Salim

分类：机器学习

2022-11-11

Predicting the health risks of patients using Electronic Health Records (EHR) has attracted considerable attention in recent years, especially with the development of deep learning techniques. Health risk refers to the probability of the occurrence of a specific health outcome for a specific patient. The predicted risks can be used to support decision-making by healthcare professionals. EHRs are structured patient journey data. Each patient journey contains a chronological set of clinical events, and within each clinical event, there is a set of clinical/medical activities. Due to variations of patient conditions and treatment needs, EHR patient journey data has an inherently high degree of missingness that contains important information affecting relationships among variables, including time. Existing deep learning-based models generate imputed values for missing values when learning the relationships. However, imputed data in EHR patient journey data may distort the clinical meaning of the original EHR patient journey data, resulting in classification bias. This paper proposes a novel end-to-end approach to modeling EHR patient journey data with Integrated Convolutional and Recurrent Neural Networks. Our model can capture both long- and short-term temporal patterns within each patient journey and effectively handle the high degree of missingness in EHR data without any imputation data generation. Extensive experimental results using the proposed model on two real-world datasets demonstrate robust performance as well as superior prediction accuracy compared to existing state-of-the-art imputation-based prediction methods.

translated by 谷歌翻译

Leveraging Language Foundation Models for Human Mobility Forecasting

Hao Xue , Bhanu Prakash Voutharoj , Flora D. Salim

分类：机器学习 | 人工智能

2022-09-11

在本文中，我们提出了一条新型的管道，该管道利用语言基础模型进行时间顺序模式挖掘，例如人类的移动性预测任务。例如，在预测利益（POI）客户流量的任务中，通常从历史日志中提取访问次数，并且仅使用数值数据来预测访客流。在这项研究中，我们直接对包含各种信息的自然语言输入执行预测任务，例如数值和上下文的语义信息。引入特定的提示以将数值时间序列转换为句子，以便可以直接应用现有的语言模型。我们设计了一个Auxmoblcast管道，用于预测每个POI中的访问者数量，将辅助POI类别分类任务与编码器架构结构集成在一起。这项研究提供了所提出的Auxmoblcast管道有效性以发现移动性预测任务中的顺序模式的经验证据。在三个现实世界数据集上评估的结果表明，预训练的语言基础模型在预测时间序列中也具有良好的性能。这项研究可以提供有远见的见解，并为预测人类流动性提供新的研究方向。

translated by 谷歌翻译

Integrating Knowledge Graph embedding and pretrained Language Models in Hypercomplex Spaces

Mojtaba Nayyeri , Zihao Wang , Mst. Mahfuja Akter , Mirza Mohtashim Alam , Md Rashad Al Hasan Rony , Jens Lehmann , Steffen Staab

分类：自然语言处理 | 人工智能

2022-08-04

知识图，例如Wikidata，包括结构和文本知识，以表示知识。对于图形嵌入和语言模型的两种方式中的每种方法都可以学习预测新型结构知识的模式。很少有方法与模式结合学习和推断，而这些现有的方法只能部分利用结构和文本知识的相互作用。在我们的方法中，我们以单个方式的现有强烈表示为基础，并使用超复杂代数来表示（i），（i），单模式嵌入以及（ii），不同方式之间的相互作用及其互补的知识表示手段。更具体地说，我们建议4D超复合数的二脑和四个元素表示，以整合四个模态，即结构知识图形嵌入，单词级表示（例如\ word2vec，fastText，fastText），句子级表示（句子transformer）和文档级表示（句子级别）（句子级别）（句子级表示）（句子变压器，doc2vec）。我们的统一矢量表示通过汉密尔顿和二脑产物进行标记的边缘的合理性，从而对不同模态之间的成对相互作用进行建模。对标准基准数据集的广泛实验评估显示了我们两个新模型的优越性，除了稀疏的结构知识外，还可以提高链接预测任务中的性能。

translated by 谷歌翻译

COCOA: Cross Modality Contrastive Learning for Sensor Data

Shohreh Deldari , Hao Xue , Aaqib Saeed , Daniel V. Smith , Flora D. Salim

分类：计算机视觉 | 机器学习

2022-07-31

自我监督学习（SSL）是一个新的范式，用于学习判别性表示没有标记的数据，并且与受监督的对手相比，已经达到了可比甚至最新的结果。对比度学习（CL）是SSL中最著名的方法之一，试图学习一般性的信息表示数据。 CL方法主要是针对仅使用单个传感器模态的计算机视觉和自然语言处理应用程序开发的。但是，大多数普遍的计算应用程序都从各种不同的传感器模式中利用数据。虽然现有的CL方法仅限于从一个或两个数据源学习，但我们提出了可可（Crockoa）（交叉模态对比度学习），这是一种自我监督的模型，该模型采用新颖的目标函数来通过计算多功能器数据来学习质量表示形式不同的数据方式，并最大程度地减少了无关实例之间的相似性。我们评估可可对八个最近引入最先进的自我监督模型的有效性，以及五个公共数据集中的两个受监督的基线。我们表明，可可与所有其他方法相比，可可的分类表现出色。同样，可可比其他可用标记数据的十分之一的基线（包括完全监督的模型）的标签高得多。

translated by 谷歌翻译

Modeling Long-term Dependencies and Short-term Correlations in Patient Journey Data with Temporal Attention Networks for Health Prediction

Yuxi Liu , Zhenhao Zhang , Antonio Jimeno Yepes , Flora D. Salim

分类：机器学习 | 人工智能 | 自然语言处理

2022-07-13

基于电子健康记录（EHR）的健康预测建筑模型已成为一个活跃的研究领域。 EHR患者旅程数据由患者定期的临床事件/患者访问组成。大多数现有研究的重点是建模访问之间的长期依赖性，而无需明确考虑连续访问之间的短期相关性，在这种情况下，将不规则的时间间隔（并入为辅助信息）被送入健康预测模型中以捕获患者期间的潜在渐进模式。。我们提出了一个具有四个模块的新型深神经网络，以考虑各种变量对健康预测的贡献：i）堆叠的注意力模块在每个患者旅程中加强了临床事件中的深层语义，并产生访问嵌入，ii）短 - 术语时间关注模块模型在连续访问嵌入之间的短期相关性，同时捕获这些访问嵌入中时间间隔的影响，iii）长期时间关注模块模型的长期依赖模型，同时捕获时间间隔内的时间间隔的影响这些访问嵌入，iv），最后，耦合的注意模块适应了短期时间关注和长期时间注意模块的输出，以做出健康预测。对模拟III的实验结果表明，与现有的最新方法相比，我们的模型的预测准确性以及该方法的可解释性和鲁棒性。此外，我们发现建模短期相关性有助于局部先验的产生，从而改善了患者旅行的预测性建模。

translated by 谷歌翻译

How Robust is your Fair Model? Exploring the Robustness of Diverse Fairness Strategies

Edward Small , Wei Shao , Zeliang Zhang , Peihan Liu , Jeffrey Chan , Kacper Sokol , Flora Salim

分类：机器学习

2022-07-11

随着在高风险决策中引入机器学习，确保算法公平已成为越来越重要的问题。为此，已经提出了许多关于公平性的数学定义，并且已经开发了多种优化技术，所有这些都旨在最大化明确的公平概念。但是，公平解决方案取决于训练数据的质量，并且对噪声高度敏感。最近的研究表明，鲁棒性（模型在看不见的数据上表现良好的能力）在解决新问题时应使用的策略类型起着重要作用，因此，测量这些策略的鲁棒性已成为一种基本问题。因此，在这项工作中，我们提出了一个新标准，以衡量各种公平优化策略的鲁棒性 - \ textit {稳健性比率}。我们使用三种最受欢迎的公平策略在五个最受欢迎的公平定义方面，在五个基准标记公平数据集上进行了多次广泛的实验。我们的实验从经验上表明，依赖阈值优化的公平方法对所有评估的数据集中的噪声非常敏感，尽管大多数表现优于其他方法。这与其他两种方法相反，这对于低噪声方案而言不太公平，但对于高噪声方案而言更公平。据我们所知，我们是第一个定量评估公平优化策略的鲁棒性的人。这可以作为选择各种数据集的最合适的公平策略的指南。

translated by 谷歌翻译